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earned patent term adjustment. See 37 CFR 1.704(b). 
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2a)KI This action is FINAL. 2b)D This action is non-final. 

3) n Since this application is in condition for allowance except for formal matters, prosecution as to the merits is 

closed in accordance with the practice under Ex parte Quayle, 1935 CD. 11. 453 O.G. 213. 

Disposition of Claims 

4) ^ Claim{s) 1^8 is/are pending in the application. 

4a) Of the above claim(s) is/are withdrawn from consideration. 

5) n Claim(s) is/are allowed. 

6) ^ Claim(s) IsS is/are rejected. 
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1 1) 0 The oath or declaration is objected to by the Examiner. Note the attached Office Action or forni PTO-152. 

Priority under 35 U.S.C. § 119 

12) 0 Acknowledgment is made of a claim for foreign priority under 35 U.S.C. § 119(a)-(d) or (f). 
a)n All b)n Some * c)^ None of: 

1 .□ Certified copies of the priority documents have been received. 

2. n Certified copies of the priority documents have been received in Application No. . 

3. n Copies of the certified copies of the priority documents have been received in this National Stage 

application from the International Bureau (PCT Rule 17.2(a)). 
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DETAILED ACTION 



Claim Rejections - 35 USC § 103 



1. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 



2. Claims 1 and 7 are rejected under 35 U.S.C. 103(a) as being unpatentable over Shubha et 

al ("AppHcation of the Wavelet ") in view of France et al ("Acoustical Properties of Speech 

as Indicators of Depression 

As per claim 1, Shubha et al ("Application of the Wavelet ") teaches: 



nar 



A. setting an analysis window to a selected sample 
set length of 512, where the particular sample is 
Identified as the Klh sample 

B. reading the Kth sample 

C. computing wavelet transforms of such Kth 
sample for scales in powers of 2 running from the 
1st power to the 5th 

D. storing the signal energy value as computed for 
each scale 



E. checking to determine whether the Kth sample 
is th6 last of the sample set and if additional 
samples remain, repeating steps "b" through "d" 

F. setting the median energy distribution at the 
scale for 2 to the 4th power as a threshold 



Setting window length to L ms. (page 918, 2 
column, last paragraph and FIG. 1) 



(2™^ step, FIG. 1) 

computing DyWt for scales in powers from 3rd to 5^ 
(page 919, 1*^* column, 3"* paragraph) 



inherently part of the algorithm disclosed in Fig. 1, 
since these values must be stored in computer 
memory for further comparisons. 

See Fig. 1, algorithm iterates through all segments. 



Threshold is set as 2 to 4"^ (page 919, second 
column, 2™* paragraph) 
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G. successively for each sample comparing the 
energy across the scales 



J. if the segment energy at the 2 to the 4th power 
scale exceeds the threshold, classifying the 
segment as voiced otherwise classifying it as 
silence. 



(See FIG .1 and page 919, 1 column, last 
paragraph) - "... in addition to checking whether the 
local maxima in DyWT correlates across two 
scales." 



Page 919. 2™* column. 2"** paragraph. 



Shubha et al, do not disclose: 



A. setting an analysis window to a selected sample set length of 512 

C. computing wavelet transforms of such Kth sample for scales in powers of 2 running from the 1st 
power to the 5^^ 

H. if the maximum energy is at the scale for 2 to the 1st power, identifying the segment as unvoiced and 
proceeding to the next succeeding sample 

I. if the segment maximum energy is at one of the scales of 2 to the 2nd power through 2 to the 5th 
power, identifying the segment as being either voiced or silence 



Regarding step A, it would have been obvious to one of ordinary skill in the art at 
the time the invention was made to modify Shubha et al. to use sample set length of 
512. Applicant has not disclosed whether any specific set length provides an advantage, 
is used for a particular purpose or solves a stated problem. One of ordinary skill in the 
art. furthennore. would have expected Shubha et al. to perform equally well with other 
sample set lengths because varying sample set length in Shubna et al. would only 
change processing requirements of the system, but would not affect the essence of 
Shubna et al's invention. 

Regarding steps C. H and I, Shubna et al. do emphasize that voiced speech is 
usually present at 3^^ to 5*^ scales, while unvoiced speech is only present at lower 
scales (p. 919. Col. 1 , last paragraph). Since Shubna et al. do not try to accomplish full 
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voiced/unvoiced/silence determination, they only attempt to find voiced parameters at 
the higher scales (3"^ -5"^). However, the above disclosure is sufficient to deduce that 
unvoiced frequencies exist at scales in powers of 2 running from 1^^ power to 2"*^ power. 
In addition, one of ordinary skill in the art could deduce that since the voiced segments 
exist at or above 2 to the 4^ power (voice) and unvoiced segments exist at or below 2 to 
the 2"^ power, then by process of elimination, signals falling in between these 
thresholds must represent silent periods. Finally, Applicant's specification and other 
claims indicate that the disclosed invention uses only voiced segments and discards the 
classified silence/unvoiced segments, so there does not appear real utility from further 
distinguishing within the unvoiced/silence category. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Shubna et al. to perfonm 
voiced/unvoiced/silence computation based on the principles disclosed by Shubna et al. 
This would allow the system to distinguish between silence, voiced and unvoiced 
segments in order to remove silence/unvoiced segments which are not useful for 
fundamental frequency (pitch) calculations (p. 919, Col. 1 , last paragraph) (as it is well- 
known in the art. unvoiced speech is not periodic and thus is not useful for pitch 
estimation). 

Shubha et al does not explicitly teach using these voicing parameter estimations to make 
a determination regarding near-term suicidal risk, however, France et al teaches using such 
estimations for determining suicidal groups (Table 8). Therefore, it would have been obvious to 
one of ordinary skill in the art of voice recognition at the time the invention was made to modify 



Application/Control Number: 09/97 1 ,777 Page 5 

Art Unit: 2655 

al to be used for suicidal risk evaluation because it would advantageously warn of impending 
suicide versus depression (France et al ("Acoustical Properties of Speech as Indicators of 
Depression abstract). 



3. Claims 2,3,6, and 8 are rejected under 35 U.S.C. 103(a) as being unpatentable over the 

combination of Shubha et al ("Application of the Wavelet ") in view of Parson ("Voice and 

Speech Processing") in further view of France et al ("Acoustical Properties of Speech as 
hidicators of Depression 

As per claims 2,6, Shubha et al ("Application of the Wavelet ") teaches: 



A. setting an analysis window to a selected sample 
set length of 512 where the particular sample is 
identified as the Kth sample 

B. computing the wavelet transform for the sample 
set at scale 2 to the 4th power, with a scale factor 
defined by the quotient of the wavelet center 
frequency at level 0 and the desired center 
frequency 

C. selecting two consecutive segments of the vocal 
signal of such person which are voiced segments 
and generating separate pulse trains in which the 
heights of the pulses correspond to amplitude of 
positive and negative peaks of the wavelet 
transformed speech signal 

D. thresholding the segments of the vocal signal to 
discard peaks corresponding to possible unvoiced 
samples 



nd 



Setting window length to L ms. (page 918, 2 
column, last paragraph and FIG. 1) 



Threshold is set as 2 to 4 (page 919, second 



column. 2 paragraph) 



(See FIG .1 and page 919, 1^ column, last 
paragraph) - "... in addition to checking whether the 
local maxima in DyWT correlates across two 
scales." 



Only voiced samples are used for pitch estimation 
(see FIG. 1, "segment is unvoiced, set pitch period 
to 0" block) 



E. V. taking the difference between two 
consecutive prominent pulses as the duration for 
the glottal cycle 



(page 919. Col. 1. 1*^ paragraph) 
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Shubha et al. do not disclose: 
A. setting an analysis window to a selected sample set length of 512 

E. computing a fundamental period over the entirely of each of the two segments by: 

i. finding the location of the first peak of the autocorrelation of the smoothed spectrum to the right 
of the zero lag component 

ii. detecting a starting pulse exhibiting the property of being larger than both the pulse 
immediately preceding and immediately following such pulse and being greater than 50% of the global 
maximum of the pulse sequence 

iii. locating following prominent pulses as detected in the neighborhood of expected locations 
determined by the peak of the autocorrelation sequence 

iv. selecting, between two sequences of positive and negative peaks, the peak having the largest 
magnitude 

and V. taking the difference between two consecutive prominent pulses as the duration for the 
glottal cycle 

P determining period-to-period fluctuation of fundamental frequency as the inverse of said glottal cycle 
for said two consecutive prominent pulses. 



Regarding step A, it would have been obvious to one of ordinary skill in the art at 
the time the invention was made to modify Shubha et al. to use sample set length of 
512. Applicant has not disclosed whether any specific set length provides an advantage, 
is used for a particular purpose or solves a stated problem. One of ordinary skill in the 
art, furthermore, would have expected Shubha et al. to perform equally well with other 
sample set lengths because varying sample set length in Shubha et al. would only 
change processing requirements of the system, but would not affect the essence of 
Shubha et ars invention. 
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Regarding steps E and F, Shubna et al. disclose computing the fundamental 
frequency as the inverse of time inten/al between local peaks (Shubna et al., page 919, 
Col. 1. 1^^ paragraph.) The method of finding the peaks using harmonic-peak-based 
detection is well-known in the art. For example, "Voice and Speech Processing" by 
Thomas Parson describes this method on pages 205-206. Firstly, Parson teaches that 
filtered (smoothed) autocorrelation of signal spectrum points to the approximate location 
of Fo (pages 198-199). Parson also teaches that "harmonic peaks occur at integer 
multiples of the pitch frequency (Fo)" and that "the differences in peak frequencies are 
integer multiples of the pitch frequency." As a result, harmonic-peak-based detection 
proceeds by searching for peaks in the range of the estimated peach (from 
autoconrelation) and fine-tuning the results by finding the maximum peaks within these 
regions (see the discussion of several different methods of pages 205-206. such as 
Snow and Hughes). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Shubna et al. to compute the fundamental 
frequency using a well-known method of harmonic-peak-detection (see Parson's book) 
in order to fine-tune the selection of peaks and thus establish the best fundamental 
frequency estimate. 

Steps A-J represent the determination of the voice segments (see claims 1,7). Steps K-0 
represent pitch estimation which requires voice samples (as required by step L). Therefore, steps 
A-J are rejected for the same reasons as steps A-J in claims 1(7). Steps K-0 are rejected for the 
same reasons as steps B-F in claim 2(6). 
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The combination of Shubha et al ("Application of the Wavelet ") in view of Parson 

("Voice and Speech Processing") does not explicitly teach using these voicing parameter 
estimations to make a determination regarding near-term suicidal risk, however, France et al 

("Acoustical Properties of Speech as Indicators of Depression ") teaches using such 

estimations for determining suicidal groups (Table 8). Therefore, it would have been obvious to 
one of ordinary skill in the art of voice recognition at the time the invention was made to modify 

the combination of Shubha et al ("Application of the Wavelet ") in view of Parson ("Voice 

and Speech Processing") to be used for suicidal risk evaluation because it would advantageously 
warn of impending suicide versus depression ( France et al ("Acoustical Properties of Speech as 
Indicators of Depression "), abstract). 



4. Claims 4 and 5 are rejected under 35 U.S.C. 103(a) as being unpatentable over France et 

al ("Acoustical Properties of Speech as Indicators of Depression ") . 

As per claims 4,5, France et al teaches: 



A method for assessing near-term suicidal risk 
through voice analysis independently of verbal 
content of the voice, comprising: 

eliciting a voice sample from a person to be 
evaluated for near- term suicidal risk and 
converting said sample into electronically 
processable signal form 

time-wise dividing said signal into segments 
according to whether the person was silent, 
speaking voiced words or making unintelligible 
unvoiced sounds 

if there are two consecutive voiced segments, 
measuring fundamental frequency for each of said 
two segments 

(partially) comparing the difference in measured 
fundamental frequency to fundamental frequency 
difference data (not disclosed) for known near-term 
suicidal risk persons, known depressed persons 
not at near-term suicidal risk and non-depressed 
persons from a control group, to determine whether 
the person is at near-term suicidal risk or is merely 
depressed. 



See title of the article 



(page 832, part B., second paragraph from the 
buttom: "Approximaately 2 min and 30 s of 
unedited speech ...) 



dividing signal into segments (same reference as 
above) 



measuring Fo for each segment (page 833, first 
and second paragraphs) 



Differences in Fo statistics were examined for 
control, major-depressed and high-risk (suicidal) 
groups (Table 8) 
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France et al ("Acoustical Properties of Speech as Indicators of Depres sion ") does not 

explicitly teach using "difference in measured fundamental frequency for said two segments" as 
an indication of suicidal tendencies (claim steps D and E). 

However, France et al ("Acoustical Properties of Speech as Indicators of 

Depression ") suggests measuring Fo range for each 20 second segment (2"'' paragraph, pg 

833). The range of a set of data is, the difference between the highest and lowest values in the 
set. Here, the 20 second segment would undoubtedly contains several (2 or 3) voiced segments, 
since Fo can only be measured for voiced segments and 20 seconds is too long of a duration for a 
single voiced segment (short of a long scream, but certainly no common in speech receded 
during a therapy session). As a result, the Fo range over the 20 second window would represent 
the Fo difference of two segments in case where there are only two voiced segments in the said 
20 second window, hi addition, France et al ("Acoustical Properties o f Speech as hidicators of 

Depression ") does discuss measuring jitter (fundamental frequency differences) for Fo in 

their article ("II. Previous Work", page 830). 

As a result, it would have been obvious to one of ordinary skill in the art at the time the 
invention of made to modify France et al ("Acoustical Properties of Speech as Indicators of 

Depression ") to use "difference in measured fundamental frequency for said two segments" 

(jitter) as another statistic for attempting to determine whether the person is predisposed to 
suicide, as the article of France et al ("Acoustical Properties of Speech as Indicators of 
Depression ") indirectly suggests the use of this statistic for one of ordinary skill in the art. 



Application/Control Number: 09/97 1,777 Page 1 0 

Art Unit: 2655 

Response to Arguments 

5. Applicant's arguments filed 2/9/2005 have been fully considered but they are not 
persuasive. As per the applicant's arguments that the Shubha & Parsons references do not teach 
new element K, examiner argues that these arguments are moot in view of the new grounds of 
rejection. As per the arguments against the France et al reference, examiner argues that the 
category of unintelligible voice sounds fall into the category of unvoiced speech as discussed by 
France et al. 

Conclusion 

6. Applicant's amendment necessitated the new ground(s) of rejection presented in this 
Office action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP § 706.07(a). 
Applicant is reminded of the extension of time policy as set forth in 37 CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS fi*om the mailing date of this action. In the event a first reply is filed within TWO 
MONTHS of the mailing date of this final action and the advisory action is not mailed until after 
the end of the THREE-MONTH shortened statutory period, then the shortened statutory period 
will expire on the date the advisory action is mailed, and any extension fee pursuant to 37 
CFR 1 . 1 36(a) will be calculated fi-om the mailing date of the advisory action. In no event, 
however, will the statutory period for reply expire later than SIX MONTHS firom the date of this 
final action. 
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Any response to this action should be mailed to: 



Commissioner of Patents and Trademarks 
Washington, D.C. 20231 
or faxed to: 
(703) 872 9314, 

(for informal or draft communications, please label "PROPOSED" or 



8. Any inquiry conceming this communication or earlier communications from the 
examiner should be directed to Michael Opsasnick, telephone number (571)272-7623, 
who is available Tuesday-Thursday, 9am-4pm. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Mr. Wayne Young, can be reached at (571)272-7582. The facsimile phone 
number for this group is (571)272-7629. 

Any inquiry of a general nature or relating to the status of this application or 
proceeding should be directed to the Group 2600 receptionist whose telephone number is 
(571) 272-2600, the 2600 Customer Service telephone number is (571)272-2600. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained fi-om either Private PAIR or Public PAIR. Status 
information for unpublished applications is available through Private PAIR only. For 
more information about the PAIR system, see http://pair-direct.uspto.gov. Should you 
have questions on access to the Private PAIR system, contact the Electronic Business 
Center (EBC) at 866-217-9197 (toll-free). ^ / 



If 



'DRAFT") 



Hand-delivered responses should be brought to Crystal Park II, 2121 
Crystal Drive, Arlington. VA., Sixth Floor (Receptionist). 
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